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Abstract 

D. Wallace has presented a formal proof of the Born rule for Everettian Quantum Mechanics, 
based on a proposal by D. Deutsch using decision theory. We check his concepts and axioms for 
problems, suggesting possible corrections. His proof was reorganized and clarihed, with some 
flaws being identihed and improvements proposed. Parallels with classical decision theory are 
traced, to dispel the idea that Born weights play the role of Savage’s subjective probabilities. 
Unfortunately, many problems could not be fixed, possibly invalidating his result. 


1 Introduction 

Despite its successes. Quantum Mechanics still has unsolved conceptual problems, regarding 
measurements and the wavefunction collapse. Everett’s formulation |EI57l[DG73 |. also known 
as Many Worlds Interpretation, tries to solve them by rejecting the Measurement Postulate, 
and applying the rest of the usual formalism even to macroscopic systems. In this theory, the 
quantum state of an observer, after a measurement, is a superposition of different versions 
of himself, each correlated to one of the results. The wavefunction collapse is only apparent, 
due to the fact that each version is unaware of the others. This may seem far-fetched, but is a 
natural consequence of the quantum formalism minus the Measurement Postulate, and actually 
gives a clearer description of the measurement process than the Copenhagen Interpretation. 

But as this formalism solves old problems it creates new ones. The preferred basis problem 
is how to properly carve classical realities out of the quantum state of a macroscopic system. 
Decoherence has been proposed as a mechanism to achieve this. The probability problem 
regards a possible disagreement with experiments. Quantum measurements give probabilistic 
results, with chances given by the Born rule. As this rule is removed with the Measurement 
Postulate, and the rest of the formalism is deterministic, it is not clear how to explain the 
observed probabilities. 

To deal with this last problem, D. Deutsch |Deu99 | applied decision theory to the Everettian 
setting. A rational agent, following the Everettian formalism, must decide on bets involving 
quantum measurements. Deutsch tries to show that quantum symmetries make him decide as 
if results were probabilistic and followed the Born rule. This received some criticism, which led 
to increasingly refined arguments, culminating in D. Wallace’s formal proof |WallOI IWall2| . 
Critics remain unconvinced, questioning some axioms or the meaning of his result, but there 
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has been, so far, no criticism of the proof itself. Some supporters have proposed variations of 
Wallace’s ideas, but no attempts have been made to improve and clarify his formal presentation. 
We seek to fill such gaps, by giving a detailed analysis of his arguments and proof, showing 
their weaknesses and, when possible, proposing improvements. 

Section [2] reviews the measurement problem, the Everettian formalism, its problems, and 
some proposed solutions. It includes the results of our analysis |Manl7 | of Wallace’s solution 
to the preferred basis problem, which lays the foundation for concepts used in his proof of the 
Born rule. Rationality in an Everettian universe, and the relevance of branchings, are discussed 
in preparation for the analysis of some axioms. 

In section [S] Wallace’s concepts, axioms and proof are examined. We provide a new pre¬ 
sentation, in an attempt to clarify it, and point out problems that were found, proposing ideas 
on how to fix some of them. Parallels with classical decision theory are traced, to dismiss the 
idea that Savage’s subjective probabilities are at use. In section |4l this comparison is done in 
simplified examples, which can also help in grasping the essence of Wallace’s proof. Section [5] 
summarizes the problems found, and assesses the meaning of his result. [Appendix A| presents 
classical decision theory, for those unfamiliar with it, as it helps in following Wallace’s ideas. 


2 Preliminaries 

2.1 The measurement problem 

In the Copenhagen interpretation of Quantum Mechanics (CQM), the Measurement Postulate 
states that, if a system in a state 

i 

with {i I j) = 5ij and |cip = 1, is measured with respect to the basis {|*)}, the result will be 
one (and only one) of the i’s, with the state collapsing to the corresponding |i). Also, results 
are probabilistic, according to the following rule. 

Born Rule. The probability of result i is given by pi = Wi, where Wi is its Born weight, 

wi = \cif = \{i\iP)\\ (2) 

This postulate agrees with experimental data, but is conceptually ambiguous. It sets mea¬ 
surements apart from other quantum processes, which obey the deterministic linear Schrodinger 
equation, but lacks a precise definition of what are measurements. These might be distinguished 
for involving a classical macroscopic system, like an observer, but if this system’s particles obey 
Schrodinger’s equation, how can they collectively produce a nonlinear probabilistic^ process? 
And how does the collapse of the quantum state happen? Many attempts have been made 
to solve this measurement problem, such as hidden variables theories, Bohmian mechanics, 
nonlinear Schrodinger equations, and others, each with its own difficulties |WZ14I [AulOOI . 

This relates to the problem of whether Quantum Mechanics remains valid as systems get 
bigger, with Classical Mechanics emerging from it. In the usual view, quantum superpositions 
should not happen at the macroscopic level, lest we observe Schrodinger cats. But nothing 
in the quantum formalism seems to induce their disappearance in large systems, quite to the 
contrary. So many physicists consider CQM valid only for microscopic systems, with another 
theory being needed to explain the quantum-classical transition. This point of view becomes 
problematic as quantum phenomena are verified at increasingly larger scales, or for research 
in fields like quantum cosmology. Some see decoherence as an explanation for the emergence 

^Probabilistic even in principle, not simply due to lack of knowledge about the states of the particles. 
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of classicality, but it is questionable whether it eliminates superpositions, or merely wipes out 
interference between their components, which survive nonetheless. 


2.2 Everettian quantum mechanics (EQM) 

A solution, proposed by H. Everett III |EI57I|DG73| . rejects the Measurement Postulate, and 
applies the rest of the quantum formalism even to macroscopic systems. Evolution is always 
deterministic, even in measurements, following Schrodinger’s equation. It leads to macroscopic 
superpositions, but also explains why observers do not perceive them. If not for some unsolved 
problems, it might explain quantum measurements, and provide the missing link between 
quantum and classical mechanics. 

In EQM, a measurement is just entanglement of the measuring device with whatever is being 
measured. More precisely, a measuring device for a basis {|i)} of a system is any apparatus, 
in a quantum state |D), which interacts in such a way that, if the system is in state |i), the 
composite state evolves a^ 

|i) (g) \D} I—^ |i) 0 \Di) , (3) 

where \Di) is a new state of the device, registering result i. Linearity of Schrodinger’s equation 
implies that, if the system is in state 0 , the composite state evolves as 

l-i/)) 0 ID) = ( |i) I 0 |D) 1—^ ^Ci|i)0|Di). (4) 


This Hnal state is to be accepted as an actual quantum superposition of macroscopic states. 
But it will not be perceived as such by an observer looking at the device, as, by the same 
argument, his state \0) will evolve into a superposition, according to 


Ci |i) 0 \Di) j 0 \0) 


Ci \i) 0 \Di) 0 \Oi ), 


with \Oi) representing a state in which he saw result i. By linearity, each component |i) 0 |Di) 0 
\Oi) evolves independently, as if the others did not exist, as long as interference is negligible. 
In section [m we show why this assumption may be reasonable. 

Everett’s interpretation of this final state is that the observer has split into different versions 
of himself, each seeing a distinct result. Each version evolves as if the initial state had been 
|i) 0 \D) 0 |0), so he does not feel the splitting, nor the existence of his other versions. Each 
component is called a world or a branch, and this evolution of one world into a superposition of 
many is called a branching process. So in EQM all possible results of a measurement actually 
happen, but in different worlds. The observer in state \Oi) only thinks the system has collapsed 
into \i) because he does not see the whole picture, with all other results and versions of himself. 

In this language, (J2i Ci I*) ® \Di)) 0 |0) = Ci I*) ® \Di) 0 |0) could be read as saying 
there are identical copies of the observer, in different worlds, even before he sees the result. This 
seems to imply measurements cause him (and, ultimately, all Universe) to split instantly, before 
interactions have time to reach him, violating locality. Worse yet, the whole state is already split 
in branches before any measurement takes place, as Ci |i)) 0|D}0|O) = |i)0|D)0|O). 

And it may be split in different ways at the same time, depending on the decomposition basis 
used for \'ip). 

This weirdness is the fruit of bad terminology. We can only say something physical (not a 
mathematical artifact) happened to the device or observer once their states change. Instead of 
“worlds that split”, a more precise (but cumbersome) description might be “local macroscopic 

^For simplicity, we assume the system remains in state |i), but this is not necessary. 
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superpositions, which evolve with negligible interference, and gradually spread as new systems 
interact and get entangled with them”. 

Criticism that a proliferation of worlds violates conservation laws also results from a literal 
reading, as if branching required new matter and energy to build the worlds. Energy of a 
superposition is not the sum of energies of its components, and, since branching is governed 
by Schrodinger’s equation, it obeys all conservation laws. So the mean energy remains the 
same, and what really requires explanation is why we do not observe energy variations in some 
branches at the expense of others. 

Problems plaguing CQM disappear in EQM, but new ones come along, as the probability 
problem and the preferred basis problem, described in the next sections. Solving them would 
put EQM in a better theoretical standing than CQM, and might even reveal some testable 
difference between them. 


2.3 The probability problem 

In EQM, any result i with Ci 7 ^ 0 is obtained with certainty when measuring ([T]), even if only one 
version of the observer sees it. The probability problem is reconciling this with experiments, 
which indicate results are probabilistic and follow the Born rule. 

It has a qualitative aspect, how probabilities emerge from a deterministic theory. In classical 
mechanics, processes can appear random due to our ignorance of details, but in EQM one must 
explain randomness even if the quantum state and its evolution are perfectly known. Some 
authors |Vai98llSaul0l[SC16 | say there is a self-locating uncertainty in the time after the device 
measures the system, but before the observer sees the result. In their view, in this interval 
branching has already happened, but each version of the observer is still ignorant about his 
branch. But this assumes he splits before information about the results reaches him, which 
is a non-local (and arguably not valid) view of branching^, as pointed by Kent | Kenl5| and 
discussed in section m 

There is also the quantitative aspect of accounting for probability values, verett |EI57| 
proved that if a measure is attributed to branches, and is preserved by further branchings, it 
equals the Born weights ©. And, as the number of measurements tends to infinity, the total 
measure of branches with results deviating from the Born rule tends to 0. For finite experiments, 
this means branches with frequencies deviating beyond a given error have small measure. But 
this only makes them negligible if Born weights have a probabilistic interpretation, leading to 
a circular argument. A similar idea was proposed by Graham |Gra73| . with the same problem. 
Gleason’s theorem | Gle57| also implies the Born rule, if the probability of a branch does not 
depend on what other branches the decomposition basis has. Until we know how probabilities 
can emerge in EQM, we can not be sure they will have the properties required by Everett or 
Gleason. Of course, if they do not, it would mean EQM is incorrect. 

A counting measure, based on the idea that a measurement with n results gives n branches, 
may seem natural for EQM. It violates Gleason’s and Everett’s hypotheses, as the probability 
of a branch would depend on the number of branches in the basis, and it is not conserved 
if a new measurement is performed only in some branches. After many measurements, any 
sequence of results will appear in some branch. But the frequency of each result will tend to 
1/n in most branches, as if all results were equally probable, in disagreement with Born’s rule 
and quantum experiments. 

But it might not be possible to count branches. As, in EQM, measurements are no dif¬ 
ferent than other quantum processes, branchings can happen in all interactions, becoming a 
continuous and pervasive phenomenon. Throwing a die produces not 6 , but a large number 

®This does not apply to a similar idea in the cosmological interpretation of quantum mechanics lATlll . 
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of branches, resulting from interactions with the air, table, etc. This number might be chang¬ 
ing all the time, in unpredictable ways, and be unevenly distributed among the 6 results, as, 
for example, the number of black dots showing might affect the amount of branchings due 
to interaction with light. Most interactions involve few particles, so many branches will be 
macroscopically similar. A coarse-graining, as in section [2.4.31 might reduce and stabilize the 
number of branches, but it would be somewhat arbitrary, depending on the chosen fineness of 
grain. 

It makes no sense to literally prove Born’s rule in EQM, so by a Bom-like rule we mean 
any result explaining why, in an Everettian universe (i.e. one governed by EQM), quantum 
experiments would appear probabilistic, with probabilities given by the Born weights Other 
attempts |AL881 lHan03l [ZurOSlIBHZ06| have been made to obtain such result, but we focus on 
Deutsch and Wallace’s use of decision theory. 

2.3.1 The decision theoretic approach 

Decision theory ( [Appendix A[ ) aims to explain, in situations with probabilistic outcomes {de¬ 
cision under risk), how rational (in an axiomatically defined sense) decisions should be made. 
When probabilities are unknown {decision under uncertainty), it shows rational decisions can 
be described in terms of subjective probabilities. 

Deutsch |Deu99| proposed using it to show that, in an Everettian universe, a rational 
decision maker should decide on bets about results of quantum experiments as if outcomes 
were probabilistic, with Born weights playing the role of probabilities. He should behave as if 
such Universe were Copenhagean, with the following strategy: 

Born Strategy. Decisions follow the \Principle of Maximization of Expected Utilit^ with Born 
weights © in place of probabilities in the expected utilities dlB, i.e. 

EU = ^ Wi • u{ri). 

i 

It may seem reasonable to describe the illusion of chance in EQM in terms of subjective 
probabilities. But, contrary to a common misconception, this is not what Deutsch’s work is 
about, as we show in section [4] His probabilities are anything but subjective, with their values 
imposed b y the theor y. 

Critics [BCF'*~Oo1 IPriOGl IMal08| contested his assumptions, proposed reasonable decisions 
that do not follow the Born Strategy, or called into question the meaning of his result. In 
response, Wallace |Wal03l [Wal071 IWallOl IWall2| has made increasingly sophisticated attempts 
to clarify and formalize Deutsch’s idea. Many authors j SClGl IPolOll lAssllI IWill3l ICS13 | 
think this is on the right path, and variations have been proposed, in an attempt to clarify the 
situation. Critics remain unconvinced |Pril0llFin091IKenl0IIAlbl01IMaul4IIJanl6 |. questioning 
ideas behind Wallace’s axioms, or presenting examples of reasonable decisions contrary to his 
result. But most have focused on his informal presentation, and so far there has been no 
detailed criticism of his formal axioms and proof jWallOl IWall2| , which stand as the strongest 
defense of the decision theoretic approach. In section [3] we try to fill this gap. 

Proving a Born-like rule via decision theory requires worlds where narratives with agents 
and bets make sense. This involves solving first the problem we describe next. 

2.4 The preferred basis problem 

It consists in finding a natural way to decompose a macroscopic quantum state into branches 
behaving like the classical reality we observe (even if not all of them, and not all the time). We 

^ See [Appendix A] 
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focus on Wallace’s approach |Wall2| . based on decoherent histories. 

2.4.1 Decoherence 

Decoherence |Zur02l ISch071 |JZK+03| is a process by which an open quantum system loses 
some quantum characteristics, as it interacts and gets entangled with its environment. Many 
models show some states, called pointer states, are more robust with respect to such interaction, 
having a stronger tendency to remain disentangled. For example, in the quantum brownian 
motion, considered a good paradigm for a macroscopic system scattering particles from the 
environment, they are minimum-uncertainty Gaussian packets (coherent states) \x,p), forming 
an overcomplete set of vectors. 

As the environment is differently affected by distinct pointer states, and such differences 
spread across its many degrees of freedom, it rapidly evolves into (almost) orthogonal states. 
Off-diagonal elements {eoherences) of the reduced density matrix of the system, with respect 
to the pointer states, decay extremely fast. This (almost) eliminates interference between such 
states, and we say the system has deeohered. Close pointer states take longer to decohere, but 
this problem can be reduced by coarse graining, as we discuss in section 12.4.31 As the reduced 
density matrix becomes diagonal, it formally resembles a classical probabilistic mixture, and 
this is seen as an important step in obtaining classical behavior out of quantum systems. 

The reduced density matrix of an open system follows a master equation, and its dynamics 
lacks unitarity, which is central to Wallace’s arguments. So in describing the evolution of 
branches he turns to the consistent/decoherent histories formalisms, which apply to closed 
quantum systems (ultimately, to the whole Universe). 

2.4.2 Consistent Histories (CH) 

The consistent histories formalism |Gri841 [Gri02l [QmnSSl IOmn99 | identifies conditions allowing 
us to assign classical probabilities to sets of alternative histories, conceived as sequences of 
events or propositions about a quantum system. 

Its point of view is opposite to the Everettian one: quantum evolution is always stochastic, 
and quantum states are only tools for calculating probabilities in a set of possible evolutions, 
only one of which actually happens. Still, parts of the formalism adgrot well to the Everettian 
setting, if appropriately reinterpreted. We present a simplified versiorQ assuming a normalized 
pure initial state 'tpo at time to. 

A quantum sample space is an orthogonal projective decomposition of Hilbert space, a family 
{To,} of orthogonal projection operators with Pq, = 1 and PaPp = SapPa- It represents an 
exhaustive set of mutually exclusive events or propositions about the system. The Heisenberg 
picture is used, with Pa{t) = U{t,to)~^PaU{t,to), where U{t,to) is the unitary time evolution 
operator given by Schrodinger’s equation. 

A history space is a sequence {Pai (ti)},.. ■, {Pa„{tn)} of quantum sample spaces, at times 
to < ti < ... < tn- A history a = (ai,..., a^,..., a„) is a sequence of events, specifying one 
Pa^{tk) from each sample space. In each history space, satisfying a condition described below, 
only one history happens, with probability postulated by 

Pc = llV’all^ , (5) 

where the branch state vector i/iq, = Pa„ {tn) ■ ■ ■ Pax {ti)'ipo is considered as just a mathematical 
tool to calculate probabilities. 

A history space {Pa^i^k)} is a coarse-graining or coarsening of {Pa,,{tk)} if each Pa^{tk) 
is a sum of Pa^itkYs. Gonversely, {Pa,.{tk)} is a, fine-graining or refinement of Pa^{tk). This 

®Refiecting its stochastic point of view, CH is usually presented using density operators. 
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fits in the formalism by bundling history spaces into a history algebra, specified by taking, at 
each tk, a quantum event algebra, which is a Boolean algebr4j of orthogonal projectors. The 
Boolean condition implies projectors commute, so when their ranges have intersection {0} they 
are mutually orthogonal. 

To get a history algebra, we can start with a history space and take all coarsenings. The 
original projectors are minimal elements (atoms) generating the event algebras. Not all history 
algebras are formed this way, and some have no atoms. An algebra is atomic if it is generated 
by atoms. Wallace uses the term in such sense, but defines |Wall21 p.95] it in terms of countable 
generators. It is incorrect, as any countable algebra is generated by all its elements, even if it 
has no minimal ones. 

Given a coarsening {Pa,^{tk)} of {Pa^,{tk)}, each coarser history a can be seen as a family 
of finer ones. We write q € d if, for each k, the range of Pa^ (tk) contains that of Pc^, (tk)- CH 
requires the p^’s to behave as classical probabilities, so they must be additive, Pa = 

Hence there must be no interference between histories, and (ppa\'4>g) = 0 if cr 7 ^ /9. History 
spaces satisfying this are called consistent (relative to and are the only ones allowed 
in CH. Two consistent history spaces can be incompatible, in the sense that they can not be 
combined into a single consistent one. Either one can be used to describe a quantum process, 
but not both at once. 

A history space has a branching structure (relative to i/iq) if histories do not merge after 
diverging, i.e. if a and (3 satisfy ai ^ Pi and aj = Pj, for some i < j, one of them has zero 
probability. Consistency and branching are related as follows |Gri931IWaIT^ . 

Branching-Consistency Theorem. 0 Any history space with a branching structure is con¬ 
sistent, and the converse holds for some consistent refinement of it. 

The idea |Wall2l p.429] for the converse is that, since the fiia’s are orthogonal by consistency, 
a branching refinement can be constructed from projectors ■^\'ipa){'tpa:\. Wallace claims [Wall21 
p.96] this result extends to history algebras, but it seems wrong, as these projectors might not 
be in the original algebra. 

2.4.3 Decoherent Histories (DH) 

CH admits histories that are far from classical. The decoherent histories formalism |GMH90l 
ICMH93] combines CH and decoherence to get special history spaces with a more classical 
behavior. Besides (approximate) consistencjjj, it also seeks quasi-classicality, meaning histo¬ 
ries approximately follow classical equations of motion, interrupted at times (as in quantum 
measurements) by some quantum behavior. 

In DH, the space of relevant variables of the system of interest is partitioned in cells Ea, 
large in comparison with the coherence lenght (below which decoherence is not effective), yet 
small enough for the required precision. This determines a quantum sample space given by 
operators Pa ® lenv, where 

Ha = da:|®)(®|, ( 6 ) 

and lenv is the identity operator for the environment (taken to include the irrelevant variables 
of the system). A history a specifies, at a sequence of times tfc, in which Eq:^, the system if^. 

®With operations P\ A P2 = Pi W P2 = Pi + P2 — PlP2-, and ->P = 1 . — P. 

^The terminology varies. Wallace uses consistency for additivity of probabilities, and decoherence for non¬ 
interference. The concept of decoherence from section [ 2 . 4 . II is related but not equivalent to this one. 

®Branching-Decoherence Theorem, in Wallace’s terminology. 

^In Gell-Mann and Hartle’s terminology, medium decoherence. 

^^If necessary, different families of cells can be used at each time. 
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Coarsenings and refinements are obtained varying the cell size. 

For large enough cells, states in the range of different Pa’s are distinct enough to quickly 
get entangled to (almost) orthogonal states of the environment. Orthogonality tends to subsist, 
as the environment evolves, for its many degrees of freedom keep records of the history. For 
example, particles scattered by the system in different directions, at distinct Ec’s, tend to 
remain in (almost) orthogonal states, and affect other degrees of freedom in distinct ways. 
Erasing from the environment all traces of the history is impossible in practice. Of course, as 
different states of the environment evolve, they can spontaneously develop similar components, 
blurring their records, but their Born weights would be negligibly small. 

As records ensure (almost) consistency, (almost) additive probabilities can be assigned to 
histories, as in CH. Some interference can remain, but it is negligible if histories are coarse 
enough. Deviations in the additivity of probabilities should be irrelevant, as long as they are 
too small to be detected experimentally. 

The |Branching-Consistency Theorem| gives an (approximate) branching structure for a re¬ 
finement. But it will not be in terms of projectors Pa 0 lenv into smaller cells Ea, as it will 
require projecting environmental states onto the different records. 

Models show that, if histories are coarse enough, and the system has enough inertia to resist 
noise from the environment, histories with non-negligible probabilities will (approximately) 
follow classical equations of motion, with a stochastic force. 

So a (almost) consistent history space seems to emerge naturally in DH, and its non- 
negligible histories are quasi-classical. However, we can still have incompatible descriptions of 
the evolution, as shown by Dowker and Kent | DK96| . 

2.4.4 Non-Probabilistic Decoherent Histories (NPDH) 

Wallace uses DH to solve the preferred basis problem. But EQM is not stochastic, and all histo¬ 
ries happen, so probabilities should not be postulated, but obtained solving the probability 
problem. As his proof of the Born rule depends on branches provided by the first solution, it 
forms a circular argument |Bak07| . 

By Non-Probabilistic Decoherent Histories we mean the non-probabilistic parts of DH, rein¬ 
terpreted from an Everettian perspective. Approximations justified in DH on the basis of neg¬ 
ligible probabilities must be reexamined. Wallace has given | Wall2l p.253] non-probabilistic 
reasons for them, but, as we show in | Manl7 |. they do not seem valid, and NPDH has strange 
features: 

1. Absent a reason to disregard them, small amplitude branches are as relevant as large 
ones. This causes branch discontinuities: arbitrarily small changes in a quantum state 
can create lots of non negligible branches. Approximations used in decoherence can no 
longer be justified. 

2. All macroscopic histories happen and are equally relevant, no matter how erratic. Branches 
exhibit macroscopic quantum jumps all the time, with no macroscopic sense of causal¬ 
ity. Those allowing a meaningful macroscopic narrative are the exception, and even they 
keep sprouting weird subbranches. Records are unreliable, and tiny branches can suffer 
interference from much larger ones. An alternative view is that there is no real history, 
as all macroscopic states are always present. 

3. Even quasi-classical branches might not resemble our world, unless they have large Born 
weights. Without a Born-like rule, preferring these would be cherry picking those similar 
enough to our world to give results we want. So no premise can be accepted, and no 
possibility excluded, based on our physical experience. 












4. Orthogonality might not mean in EQM the same as in CQM. Measnrements (or branch 
decompositions) in almost orthogonal bases may be possible, and orthogonal states might 
not be mutnally exclusive upon measurement. An almost orthogonal set of branches can 
be linearly dependent, so, even allowing for coarsenings or refinements, branch decom¬ 
positions might not be unique. Two decompositions of the same state can involve quite 
different sets of branches. 

5. Decoherence does not really happen in terms of cells with precise boundaries, as pointer 
states \'iTx) tend to be like gaussian packets, peaked at a point x, but with small tails 
across all space. In DH it does not make much difference, as the tails give negligible 
probabilities. In NPDH it may be better to replace by the following projectors, which 
can give more stable branches, but are only almost orthogonal to each other: 

fa = (7) 

Once we have a Born-like rule, these problems should not matter, affecting only very unlikely 
branches. But in proving such rule they must be taken into account. 

2.4.5 Causal Histories 

A better solution to the preferred basis problem could be through the causal histories formalism 
proposed in |Manl7| . It gives Born weights a non-probabilistic role, as measures of resistance 
to interference, and causality. A history of tiny amplitude can be disregarded not for being 
unlikely, but for suffering so much interference that it becomes impossible to track causal 
relations between its events. 

Such formalism is in development, and many questions need answer to ensure it would 
work as expected. If it does, it would show Born weights have a role to play in EQM prior to 
any Born-like rule. And it should give well behaved branches, similar to our classical reality, 
providing an appropriate framework for Walace’s proof. 

2.5 Rationality 

If there is intelligent life in a branch, would it reason in ways we can comprehend? Would some 
kind of rationality even be possible? Wallace’s answer is yes. He claims |WallOI p.237] that, 
in an Everettian Universe, there is at least one rational way to decide, the |Born Strate^ and 
the aim of his proof is to show there is no other. But this argument is circular: that strategy 
is called rational because it satishes his axioms, while some of them are justihed assuming 
rationality is possible. 

In decision theory, rationality means compliance with certain axioms. These are chosen as 
rules intelligent people would agree should be followed, to have the best chance of achieving an 
objective (although, in practice, it is common to unwittingly disobey them). Philosophically, 
one might want to include the choice of objective in the analysis of rationality, but, for our 
purposes, a purely operational approach suffices. That the rules were chosen correctly is con- 
hrmed by the course of action they engender. For decision under risk, it can be shown, via law 
of large numbers, to be the one most likely to give best results in the long run. For decision 
under uncertainty, it depends on how good the agent’s estimations of probabilities are. 

To build an Everettian decision theory, Wallace chooses axioms he thinks an intelligent 
Everettian agent would follow. But, given the unknowns about an Everettian universe, even 
the best educated guess might be off. To test the axioms, one might check if an agent who 
abides by them tends to be better off, in the long term, than one who does not. But it depends 
on how we measure success, as there will be several versions of the agent, each with a different 
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set of results. Some counterexamples to Wallace’s result, like the distributive-justice or variety 
rules |Wall2l sec. 5.8], arise from lack of a consensual criterion for evaluating success in a 
branching Universe. 

In NPDH, it may not even make sense to talk about consequences of a decision. If all 
histories happen, all decisions lead to the same results: all possible ones. Even if a decision 
increases the Born weight of a result, without a Born-like rule this may be irrelevant. In an 
universe lacking macroscopic causality, there is probably no meaningful way to say a decision 
is rational. Or, what amounts to the same, we could say all decisions are rational. In any case, 
using such label to mean adherence to a set of axioms, which can not be justified a priori nor 
a posteriori, is pointless. 

2.6 Relevance of Branchings 

Some of Wallace’s axioms assume the agent does not care about branchings per se. The relative 
relevance of sets of branches, however he measures it, does not change just because a quantum 
experiment is performed in one set. This holds if he measures it via Born weights, as branchings 
preserve them, but not with branch counting. 

As seen in section to our knowledge there is no good way to count branches, as it 
depends on the decomposition basis, and varies wildly as branchings occur all the time. A 
coarse graining might give us a number, but it would be somewhat arbitrary, depending on the 
fineness of grain. And in NPDH it might be infinite. 

But failure of this crude idea of branch counting does not exhaust all possibilities. There 
might be a more sophisticated way to estimate the amount of branches, or another measure of 
relevance not preserved by branchings. We throw some ideas, just to show nothing in EQM 
seems to forbid it: 

• there may be an optimal interval of fineness, coarse enough to reduce interference and 
stabilize the number of branches, yet fine enough not to mix macroscopically distinct 
states. The average number in this interval might be a good measure; 

• a weighted average, giving smaller weights to the less stable branch numbers obtained on 
finer grainings, might result in a good number; 

• branches with tiny Born weights suffer much interference, so we might count only the 
more stable causal histories | Manl7 |: 

• even if we can not attribute a stable number to each set of branches, obtaining a stable 
ratio between them would suffice to measure their relative relevance; 

• if the wavefunction can be truncated below some cutoff, a measure could be given by the 
volumes of regions in which it is above the cutoff. If cells are small, this gives a stable 
way to compare ratios of their numbers. 

• the agent might have some qualitative way to compare branching structures and say one 
is richer than other, or that quantum measurements make it more ramified. 

Even if these ideas fail as well, the point is we can not say a branching sensitive measure 
is impossible just because we do not know any. Not proving the relevance of branchings is not 
the same as proving their irrelevance. 


3 Wallace’s formal proof of the Born rule 

Wallace’s latest attempt |WallOI IWall2| to formalize Deutsch’s ideas begins with an informal 
presentation of the ideas, and justification for the axioms. It is followed by formal definitions 
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and proof, but at times they bear little resemblance to his explanations, not translating correctly 
the ideas discussed. Also, expressing quantum ideas in a language closer to Decision Theory 
has taken its toll, making the notation clumsier, and letting concepts seem less troublesome 
than they are. 

We provide a new presentation of the proof, criticism of its problems, and suggestions to 
improve it. Although we follow the same general lines, changes were made, which, in our 
opinion, clarify it without affecting the result. We stick to the usual quantum notation, to 
remove a layer of complexity, and to avoid accepting a result just because it sounds reasonable 
in a decision theoretic terminology. 

3.1 Basic concepts 

A quantum decision problem is specified by: 

• A separable Hilbert space H. 

• A collection £ of closecF^ subspaces of "H, with H G £, forming a complete Boolean algebra 
under countable operations of A, V, and J-F^ 

• A finit^3 partition TZ oiH (a partition of S G f is a set of mutually orthogonal elements 
of £ whose disjunction is E). 

• A subset M <Z £ such that any E G £ has a partition in elements of Ai. 

• For each E G £, a. set Ue oi unitary operators from E into 

A solution is given, for each state '0 of each M € Af, by a preference order on Um, 
satisfying axioms given in section [3^ From the symbols and are dehned 

as usual. 

Notation. To keep notations simple and consistent, we adopt the convention that: 

• tp, 4>, ip are always used for states; 

• M, N for elements of A4; 

• r, s, t for elements of 7Z; 

• E, F for elements of £■, 

• U, V, W, X for elements of some Ue- 

• Ou represents the intersection of &\\ E G £ containing the range of U. 

• Hb is the orthogonal projector onto E. 

• indices i,j run over countable index sets. 

Justifications for the axioms depend on how £, TZ, M or Ue are interpreted. Wallace 
hints at how he conceives of them, but important details are left out, allowing for conflicting 
interpretations. He describes |Wall2l p.l63] a quantum decision problem as one in which, in an 
Everettian Universe, a system prepared in some initial state is to be measured in some basis. 
Bets are available, giving, in each branch, a certain payoff, depending on the result in that 
branch. And an Everettian agent (someone who knows EQM correctly describes his Universe, 
and knows the Born weights of that state in the measurement basis) has to decide which bets 
he prefers. 

^^Wallace omits this condition, which is necessary to avoid some difficulties, like (i ?^ E. 

^^See section [3. 1.1 1 for definitions. 

^®This condition is added for simplicity. See section [STiS] for a discussion. 

^^Wallace’s definition includes some conditions, which we placed with the Richness Axioms. 
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In this setting, T-L must be the Hilbert space of the total system of interest (agent, measuring 
device, payoffs, etc.), plus its environment. Ultimately, it must be the Hilbert space of the 
whole Universe, as for open quantum systems evolution is non-unitary, which would invalidate 
Wallace’s proof. 

3.1.1 Events 

An element of £ is called an event, and intuitively it is the subspace spanned by all states 
satisfying some proposition. For example, we could have an event E spanned by all states in 
which a spin measurement resulted up and the agent received $10. 

Operators A, V, _L play in Quantum Logic |BVN361 IEGL09| roles similar to the connectives 
AND, OR, NOT of Classical Logic. A conjunction AiEi is the intersection of subspaces Ei, a 
disjunction ViEi is the closure of the span of their union, and E^ is the orthogonal complement 
of E. We also write E E F meaning E and F are orthogonal, and V for disjunctions of 
orthogonal subspaces. An important difference between quantum and classical (Boolean) logics 
is that the distributive law fails. For example, if Su, Sd and Sh are eigenspaces of spin up, 
down, and in some horizontal direction, then ShA{SuVSd) = Sh but (5/iAS„)V(S/iA5'd) = {0}. 

By requiring T to be a Boolean algebr Wallace is forcing a classical structure on it, in 
the same spirit of CH. In fact, it is equivalent to the quantum event algebras. He uses the 
same algebra at all times, as for him events represent macroproperties, which emerge naturally 
in EQM via decoherence, as in DH. Distributivity implies E = {E A F) V {E A F^), so the 
Boolean condition requires the following one. 

Orthogonality Condition. Conjunction of two events is zero if, and only if, they are orthog¬ 
onal, i.e. 

E A F = {0} E EE. 

This makes conjunctions correspond to orthogonal projections, E AF = HfF. 

Events E and F are disjoint or mutually exclusive if, whenever a state of E is tested for 
the condition of F, the result is false, and vice versa. Classically, it means no state has both 
conditions, and is equivalent to FnF = 0. In CQM this is replaced hy E E F, since measuring 
the property of F in a state nonorthogonal to it can result true. In EQM, it will definitely 
result true in some branch. The [Orthogonality ConditionI makes quantum disjointness similar 
to the classical one. 

To partition an event is to decompose it in mutually exclusive subevents. Given partitions 
{Fi} and {Fj} of the same event, {Fj} is a refinement of {Ei}, and {Ei} is a coarsening of 
{Fj}, if each Ei admits a partition in terms of F^ ’s. With the [Orthogonality ConditionI two 
partitions of an event always have a common refinement. 

As stated in section 12.4.41 orthogonality might not play in EQM its usual role. If almost 
orthogonal measurements turn out to be possible, F T F would no longer ensure mutual 
exclusivity, and, in fact, there would be no mutually exclusive events. Likewise, use of T as a 
negation operator might not be valid. 

The requirement that £ be complete is a technical condition, that every subset of the 
algebra has a supremum, necessary for operating with infinitely many elements. It is not really 
restrictive, as any Boolean algebra admits a unique completion. 

^®In [Wall2] . Wallace gives, on pp. 152 and 435, good definitions of the Boolean condition. But on pp. 95 and 
175 there are imprecise characterizations, which make it seem less restrictive than it is. 
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3.1.2 Macrostates 


Elements of At are called macrostates. Given a partition of H into macrostates Mi, 'Xp has 
a branch decomposition with branches ipi € Mi if tp = 'YP.i'^i- The |Orthogonality ConditionI 
implies two branch decompositions of tp admit a common refinement. As any E has a partition 
in macrostates, events are disjunctions of macrostates satisfying some condition, and xp £ E if 
it is decomposable in branches having such condition. 

So far, the definition of At is too flexible. It can be any subset of £ which generates it 
via orthogonal disjunctions, even Ji4 — £. Or it can be any set of orthogonal subspaces whose 
disjunction is E, with £ being the Boolean algebra it generates. Wallace mentions these cases 
| Wall2l p.l76], but his axioms can not be justified in such artificial examples, and a more 
concrete characterization of At is needed. 

He states |Wall2l p.l64] that “the choice of macrostates is largely fixed by decoherence, 
although the precise fineness of grain of the decomposition is underspecified”. They should 
not be too coarse, so that “an agent can be assumed not to care exactly what the microstate 
is within a given macrostate”. Also, “an agent can have no practical control as to what state 
she gets, within a particular macrostate, on familiar statistical-mechanics and decoherence 
grounds” |Wall2l p.170]. 

It seems he has in mind the ranges of operators Pa ® lenv, used in DH. Different families 
of cells Eq, can be used, to allow for refinements or coarsenings. Cells should be large enough 
for decoherence, yet small enough that the agent is indifferent to points in the same cell. So 
states in the same M £ M should differ, for the system of interest, only on microscopic details. 
There can be macroscopic differences in the environment, but we must assume they are beyond 
the agent’s control and he does not care about them. So each macrostate determines the state 
of the system of interest at the macroscopic level, playing the role of a classical state. 

For the |Irreversibilit^ axiom, it would be necessary to further refine macrostates by parti¬ 
tioning the Hilbert space of the environment according to all possible history records. So each 
macrostate should specify not only the macroscopic state of the system of interest, but also the 
specific record of the history that led to such state. 

If the Pq’s are as in ®, the [Orthogonality ConditionI holds, but, as stated in section [2.4.41 
macrostates might be highly unstable. This makes them bad models for classical states, and 
Wallace’s axioms hardly justifiable. With 0, we get more stability, but at the cost of that 
condition, and the Boolean one, being satisfied only approximately, with all consequences 
entailed by almost orthogonality. 

Wallace describes |Wall21 p.l65] an £ formed by functions supported in open sets of E,^. 
He says it can not be generated by macrostates, relating it to atomless history spaces |Wall21 
p.95], to show why he would not ask that all £ be constructible from A4. This is confusing, 
as his definitions do require it. In any case, the example is wrong: as the topology of 
is second countable, £ is generated by a countable subset of events, which can be taken as 
M. The problem here seems to be the confusion, seen in section [2.4.21 between atoms and 
generators. 

3.1.3 Rewards 

Elements of TZ are events called rewards. Wallace says they “represent payoffs an agent could 
get” |Wall2l p.l75], and are “. .. a coarse-graining of the macrostate subspaces ... such that an 
agent’s only preference is to which reward subspace she is in” | Wall2l p. 165]. So there seems 
to be two distinct characterizations of TZ: 

Rewards as Payoffs. To each payoff of a bet there corresponds some r £TZ, consisting of the 
disjunetion of all macrostates in which the agent received that payoff. 
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Rewards as Preferences. Rewards should be set up in such a way that the agent will be 
indifferent between two states of the same r. 

Wallace alternates between these, but they may be incompatible. Suppose some bets give a 
$10 prize, and consider, for example, that receiving it in one that takes a lot of time and effort 
may be less preferable than in a coin flip. By [Rewards as Payoffs) we should have a single re¬ 
ward subspace corresponding to this prize, which we label as r =“$10”. [Rewards as Preferences! 
would require several: ri =“$10 on a quick bet”, r 2 =“$10 on a bet requiring extra experi¬ 
ments”, etc. 

The source of confusion might be in the use of the term “payoff”. Wallace says |Wall21 
p.l67] that in his decision problems “measurements are being made and payments are being 
provided”, with “envelopes of cash”. As this is meant to justify his [Reward Availability [ axiom, 
it seems reward subspaces correspond to such payments. In other occasions he also uses payoffs 
as meaning cash or other tangible prizes. This leads to [Rewards as Payoffs) with rewards and 
payoffs equated with such prizes. 

But in classical decision theory ( [Appendix A[ ) payoffs or rewards refer not only to prizes 
awarded, but can cover other relevant factors, such as the price paid to take part in the lottery, 
the effort or time spent in the betting process, and so on. A big achievement of the theory was 
replacing the monetary value of the prize by a more flexible utility value, which can take these 
factors into account. If we took payoffs to have such broader meaning, covering all factors the 
agent might care about, the two characterizations would coincide. Classically this is not so 
hard, if we have a good intuition about the problem. But in the Everettian case it is not easy 
to decide what can be relevant, as shown in section mi for branchings. 

As we can not be sure to know all an Everettian agent might care about, it is better to 
keep using payoffs to mean cash prizes. Including other factors, and assuming the agent takes 
only them into account, risks neglecting things like branchings or erasures (see section [3.2.11) 
before we know it is reasonable to do so. It is safer to admit the characterizations of 72. are 
inequivalent, and keep that which better suits our purposes. [Rewards as Payoffs) is easier to 
implement. Not knowing a priori what the agent cares about, we would not be able, under 
[Rewards as Preferences! to set 72 as an input for the problem. We might try to define it a 
posteriori, in terms of This may have been Wallace’s initial idea, as he says that reward 
subspace is a derived concept f |WallO[ p.234], removed in [Wall2 ['l. But this would lead to 
difficulties with axioms refering to 72, as statements about it could depend on properties of 
we are yet to deduce. 

If, under [Rewards as Preferences! we take rewards to be sets of equally preferred states, it 
is not clear why they should be subspaces, closed, or mutually orthogonal. The variety rule, 
proposed by Adam Elga (' [Wall2! p.l93], also discussed in [ Maul4[ l. is an example that a 
superposition of states might be preferable to each. 

Wallace’s description of 72 as a coarse-graining of M suggests that any M will be in some r. 
But his example oi M = S, and the use of A r in the statement of [Macrostate Indifferent 
indicate otherwise. If the [Orthogonality Conditioii| is valid then M = \j A r, so an M 
not contained in any r intercepts, non-trivially, at least two rewards ri and r 2 . In case of 
[Rewards as Payoffs) having different p ayoffs in M goes against the idea that its states are 
similar. With [Rewards as Preferences! it contradicts the idea that the agent should not have 
preferences between states of M. 

Our condition that 72 be finite is just to avoid some technicalities and focus on the main 
parts of Wallace’s proof. It does not seem like an important loss of generality, and if necessary 
the proof can be adapted to work with infinitely many rewards. 
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3.1.4 Acts 


Elements of Ue are acts available at E. Intuitivelly, an act might be, for example, preparation 
of a quantum state, its measurement, placing a bet, receiving a payoff, or any other actions of 
interest. Availability of an act depends on E, e.g. the act of deciding a bet will be available 
only at events in which that bet has been placed. 

In EQM, even macroscopic evolutions are described by unitary operators. But this assumes 
closed systems, so each act U describes not only actions related to the decision problem, but 
also everything else happening in the environment (ultimately, the whole Universe). This can 
include extraneous facts, like the agent’s favorite team winning or losing a game, which could 
alter his preference for one U over another. Perhaps one could select for comparison only acts 
which do not differ significantly on their effects on the environment, but it is not clear how to 
do so in practice. 

The range U{E) might not be in £, so we use Ou, the smallest event containing it. For 
Ip £ E, a, partition Ou = Vi Mi gives a branch decomposition of Ui/’ in the Mi’s. If the 
IQrthogonality Condition] is valid, these are the only branches resulting from the act, up to 
coarsenings or rehnements. 

At a branched state there are many versions of the agent, each acting on his own branch 
Mi. In EQM their individual acts Ui € Um^ are restrictions of some U G lAij.M-- In Wallace’s 
terminology, they form a compatible act function. Being the Mi’s mutually orthogonal, so must 
be their images U{Mi). So acts in different branches are linked, threatening the independence 
of the agent’s versions. But as evolution of the Mi’s tends to preserve macroscopic differences, 
and the environment keeps (approximate) records of the macroscopic history, the images of 
the f/i’s tend to be almost orthogonal. Small adjustments might be enough to get perfect 
orthogonality. However, in NPDH such adjustments could cause the creation or elimination of 
lots of branches which, despite their tiny Born weights, can not be considered irrelevant. 

Wallace says contemplatahle might be a better term than available. This distinction is 
important, allowing us to exclude irrelevant acts, and include some which are not feasible, but 
the agent might want to consider as if they were possible. Inclusion of physically impossible 
acts is actually unavoidable. As EQM is deterministic, there is always only one possible act. 
Even the agent’s choices are predetermined, as the act of choosing might at best correspond 
to branching into all possible decisions. Of course, determinism in Classical Mechanics never 
kept anyone from worrying about decisions, which can be considered abstractly, as if the agent 
could choose what to do. Classically, that is easy, as the agent can be treated as separated from 
the process. In EQM this might not be so simple, for if we want to consider how branchings 
affect him, he must be included in the deterministic dynamics of the system. 

We note that the only acts used in Wallace’s proof are combinations of identity acts, reward 
acts, branching acts, and erasures (see section 13.2.1 1 for definitions). 

3.2 Axioms and further definitions 

Wallace’s axioms are organized in two sets: richness axioms, which are conditions on the sets 
IAe giving the agent a good selection of acts to use, and rationality axioms, which are conditions 
his preference orders must satisfy to be considered rational. 

3.2.1 Richness axioms 
Indolence. 1e £ He for any E £ £. 

Restriction. Let E, F £ S with F C E. If U £ IAe then U\f£ IAe. 

Composition. Let E £ £. If U £ IAe and V £ lAou then VU £ IAe. 
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These are straightforward. Wallace’s Continuation and Indolence axioms were replaced by 
a simpler one, as events with Ue = 0 play no role in the decision problem. 

Irreversibility. Let E, F € £. If E L F then Oif\^ _L Ou\p, for any U £ IAe\/f- 

Wallace uses C?( 71 gACl( 7 Ip = {0} instead of Oyip _L Oul f ; that is needed for the |Equivalence Lemma[ 
If the lOrthogonality ConditionI is valid these are equivalent. He also omits the necessary hy¬ 
pothesis ELF, without which we could even have E = F. 

By unitarity U{E) _L U{F), so the axiom means orthogonality is not lost when extending 
to the smallest events containing these ranges. This depends on how £ is set. With the 
[Orthogonality Condition) T Ou\p means states of U{E) and U{F) have no common 

branches, i.e. components in the same macrostate. The idea is of a branching structure: 
evolution of distinct branches can not generate a common one. 

At first, Wallace’s only argument in this direction is that an agent in a specific branch can 
not cause branches to merge |Wall2l p.l64]. But, even if this can not be done on purpose, 
it is conceivable that tiny violations of irreversibility might happen naturally. In fact, under 
NPDH, they seem to happen all the time. We may even have large violations, for macrostates 
corresponding to adjacent cells. 

Later he discusses irreversibility at lenght |Wall21 ch.9], tracing parallels with the Second 
Law of Thermodinamics. But at that point he assumes the Born rule as proven |Wall2l p.333], 
so his conclusions can not justify this axiom. Note that most physical laws are reversible, 
the Measurement Postulate and the Second Law being notable exceptions. Both are poorly 
understood, and might be linked somehow. In throwing away the first, we may have invalidated 
the second in EQM. In any case, the Second Law would not prevent branches from merging, 
it would only enact a cost for this, in terms of an increase in the entropy of the environmental 
variables. 

He also appeals to the [Branching-Consistency Theorem| [Wall2l p.334]. But it does not 
apply to history algebras, so the branching structure might not be in terms of macrostates, 
unless these are defined separating history records in the environment. And decoherence gives 
almost consistency, so the branching would be approximate. 

This axiom is only used in the [Equivalence Lemma[ where small violations of irreversibility 
would result in UiUitpi = f/ 2 C/ 2 V ’2 being satisfied only approximately. If violations are small 
enough, it might still be possible to use State Supervenience[ in that lemma, combined with 


[Problem Continuity[ and [Solution Continuity 


Problem Continuity. Let E G £. Then Ue is an open subset of the set of unitary operators 
from E into TL, in the operator norm topology. 

So, if U is available, all acts sufficiently close to it are also available. Wallace’s justification 
is physical imprecision, as the agent can not control every microscopic detail of his actions. But 
even small perturbations of U can lead to the appearance of many new branches, which, as seen 
in section [2.4.41 can not be neglected despite their tiny amplitudes. This axiom is only used in 
the [Utility Lemma] with [Standard Actsl and can be discarded if these can be constructed in a 
continuous way. 

We replaced Wallace’s Reward Availability, Branching Availability, and Erasure axioms 
by versions restricted to macrostates, and added another. Compatibility, to extend them to 
branched states. The reason is that he justifies them only for single branches, and the passage 
to multiple ones, which was left implicit, is not trivial. 

Reward Availability. For any M £ M and r GlZ, there is U G Um such that Ou C r. 

Wallace’s justification for the availability of such reward acts is that “envelopes of cash can 
always be given to people” [Wall21 p.l67], implying[Rewards as Payoffs[ Under [Rewards as Preferences! 
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it might not be so easy to produce such acts. 


Branching Availability. Let r £ TZ, M £ Ai with M C r, and 'ip £ M. Given positive 
numbers {pi} with = 1, there are U £ 14m, with Ou C r, and a partition Ou = \/iMi, 

with Mi £ M, such that = pi ■ 


This is explained by the possibility of preparing and measuring a quantum state to cause 
such branching act. If [Rewards as Payoffs is valid, this leads to a state in the same r, as no 
payoff is affected. With [Rewards as Preferences! it is not clear why we can expect Ou C r, 
unless we are sure the agent does not care about branchings. 


Erasure. Let r £ TZ and M,N £ A4 with M,N C r. If p) £ M and (p £ N, with \ijj\ = \ip\, 
there are U £ 14m and V £ 14n such that Ou, Ov C r and Lfp! = Vp. 


The idea of such erasure acts is that, once the agent receives the payoff, he can erase all 
information of how he got it (e.g., through which result, of which bet). But, as he is part of 
the quantum system, it would entail wiping out his memory of the bet. The same holds for 
the environment, which was supposed to keep unerasable records. 

As erasures do not alter payoffs, the final state remains in r, if Rewards as Payoffs| is valid. 
It is not clear why that would happen with [Rewards as Preference 

Wallace describes U and V as taking ip and p into an erasure subspace Cr, “whose states 
correspond to the agent throwing the preparation system away after receiving the payoff but 
without recording the actual result of the measuremen” |Wall2l p.l67]. To explain why Uip 
and Vp are equal, he says that, as the agent 


... lacks the fine control to know which act he is performing, all erasures should be 
counted as available if any are. It follows that, since for any two such agents all 
erasures are available, in particular there will be two erasures available satisfying 
the axiom. | Wall21 p.167] 

He seems to mean that, if an erasure takes ip to some p £ Cr, there will be others taking it 
to any other p £ Cr, as the agent has no precise control of the final microstate. If that was 
the case then we could get Up = Vp. But the argument fails if Cr is not a single macrostate. 
Even if it is, not being able to take p precisely to p does not mean it can be taken to any 
p. Information can be erased at a macroscopic level, but supposedly subsists in environmental 
records. Unless these are corrupted, as in NPDH, the sets of states to which p and p can be 
taken will not overlap. 

This axiom is only used in the [Equivalence Lemma| combined with [State Supervenience[ 
Using [Solution Continui^ one might avoid the need for prefect equality, so it would be enough 
to get Up and Vp sufficiently close to each other. 


Compatibility. Acts Ui £ IAmu obtained on mutually orthogonal macrostates Mi via \Indolence[ 
\Reward Availability[ \Branching Availability] or Wrasurel can be chosen such that Ui = U\Mi for 
some U £ U-i/.M- ■ 

As seen in section [3.1.41 acts on different branches are not independent, since their images 
must be orthogonal, but small adjustments may be enough to obtain such condition. The axiom 
assumes this can always be done for identity acts, reward acts, branching acts, or erasures, so 
they can be combined into a single act U. 

[Problem Continuity[ may allow such adjustments, but we must question if they alter impor¬ 
tant characteristics of those acts. Are small errors in the pPs relevant in Branching Availability? 
Can the condition Up = Vip in Erasure be approximate? Also, in NPDH even small changes 
can bring about lots of new branches. 
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The axiom does not work for the pair of acts U, V from one application of [Erasure] as the 
arguments for almost orthogonality of the images fail. And suppose U and V were restrictions 
of a single act. As they send a couple of states from two macrostates into Cr, it is reasonable 
to assume the whole macrostates end up in Cr. If Cr is a single macrostate (which, as seen, 
may be necessary in the argument for Erasure), this would contradict jlrreversibility] To be 
fair, this axiom is never used in the proof to combine erasures from the same pair, only from 
different applications of Erasure, and Wallace’s original formulation of that axiom reflects this. 
But this shows once more how erasures seem endowed with unreasonable capabilities. 

3.2.2 Rationality axioms 

Ordering. For each M £ M and ip £ M, is a total order on Um ■ 

This corresponds to [Completeness] and [Transitivity^ from classical decision theory. 

Act Nondegeneracy. There are M G M, ip £ M and U,V £ Um such that U V. 

By this axiom, the agent has a strict preference in at least one case. Wallace omits it, but 
assumes a consequence, the [Reward Nondegeneracy Lemma] saying when it is not valid his 
result holds trivially. Although that is true, we prefer to obtain the lemma from this axiom, 
since the lOrder on Rewardsl is derived from the order on acts. 

Reward Preservation. Let r £ TZ, M £ M with M C r, ip £ M, and U £ Um ■ If Uip £ r 
then U 1m. 

Wallace calls this axiom Branching Indifference, justifying it only for branching acts. But 
his formal statement implies the agent is indifferent to all acts preserving r, irrespective of 
their nature. This reflects [Rewards as Preferences! which, as seen in section 13.1.31 can not be 
assumed from the start, and does not serve as justification. 

It is used only with branching acts and erasures, requiring [Rewards as Payoffs] to ensure 
Uip £ r. We replace it by a new version of Branching Indifference, plus Erasure Indifference, 
which, being more specialized, might be easier to justify. 

Branching Indifference. Let r £ IZ, M £ M with M C r, and ip £ M. If U £ Um is a 
branching act then U 1m. 

This is closer to Wallace’s idea, that “an agent doesn’t care about branching per se: if a 
certain operation leaves his future selves in N different macrostates but doesn’t change any 
of their rewards, he is indifferent as to whether or not the operation is performed” [ Wall21 
p.l70]. That this may seem reasonable is due to the ambiguity of the term rewards. As seen in 
[Branching Avail ability | if [Rewards as P ayoffs] is valid then Ou C r, but preferences may vary 
inside r. Under [Rewards as Preferences! the agent is indifferent to acts preserving r, but a 
branching act may have Ou r. 

Why would the agent care about U, if it does not alter payoffs? Maybe for banal reasons, 
like the work or time spent to perform the extra experiment. Or he may think, as in section 
[m that U alters the relevance of the sets of branches with each payoff. 

Wallace also says that “a preference order which is not indifferent to branching per se 
would in practice be impossible to act on: branching is uncontrollable and ever-present in an 
Everettian universe” [ Wall21 p.l70]. This seems at odds with his view that the wild proliferation 
of branches is tamed by coarse graining. Anyway, as seen in section 12.51 it may be impossible 
to act rationally under Everettian conditions. 

Erasure Indifference. Let r £ IZ, M £ M with M C r, and ip G M. IfU£ Um is an erasure 
then U 1m. 
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It is not clear why the agent would be indifferent to erasnres, which may require a lot of 
work to erase all records of the bet from the environment, and even his own memory. That 
they do not alter payoffs is not enough, for the same reasons as above. 

Erasures are used in the |Equivalence Lemma| with |State Supervenienc^ to ensure the agent 
is indifferent between bets taking him to the same r £TZ. Snppose a betting process is enjoyable, 
while another is unpleasant, but they are otherwise equivalent. The agent might prefer not to 
include erasures in his decision, as they might lead him to choose an awful course of action, or 
forego a nice one, only because in the end he will forget it. Or, having suffered through a bad 
bet, he might in the end prefer to use an erasure and forget about it. In any case, he is not 
indifferent to erasures. 


State Supervenience. Let M, M' € M, -tp £ M, ip' € M', U,V £ Um and U', V' £ Um' 
Utp = U'lp' and Vtp = V'tp' then U V ^ U' V'. 


If 


So preferences can not depend on the acts or initial states, only on the final ones. In 
particular, Uip = Vip implies U V. Wallace argues | WalI2l p.l71] that U and V differ only 
on states the agent should not care about, for not being the actual one. But, he notes, this 
could be a limitation of the formalism, which does not detail the evolution process, and DH 
might give us more flexibility. But he says the agent would have to be indifferent “to the size 
of the temporal gaps between history projectors”, and as “we consider sequences of decisions 
made only over very short periods of time”, this “entails that acts can be represented by single 
unitary transformations”. 

This is not convincing. Bets are not so quick that nothing relevant can happen in the 
process. In Wallace’s proof they have intermediary steps, with branching acts and erasures, 
which we are not sure to be irrelevant. Even if there is no natural choice of temporal gap, this 
should not keep us from placing quantum sample spaces at steps we deem relevant. For example, 
the agent might prefer a bet with a single quantum measurement than another requiring two, 
as the first is quicker and takes less work, even if they lead to the same payoffs, with the same 
Born weights. This can easily be considered in the decision by adding one intermediary step 
in the histories. Another possibility is using time dependent evolution operators U(t) to detail 
the process. This would be an idealization, as no agent can compute these, but the same holds 
for any act U describing the evolution of a macroscopic system. 

Wallace does not explain why the initial state is irrelevant. Suppose the agent has a choice of 
two acts, one giving a $10,000 payoff, the other leading to a superposition of $0 and $1,000,000 
branches. It is conceivable that his preference might depend on whether he had initially $0 
(and could find the certainty of $10,000 very tempting) or $1,000,000 (in which case he might 
consider $10,000 as insignificant as $0). 

An argument could be that the final state includes environmental records and the agent’s 
memories of the bet and initial state. But this fails in NPDH, where records are unreliable, 
and with lErasurel which destroys memories and records. This axiom is used after erasures, in 
the [Equivalence Lemm^ 

In classical decision theory, lotteries m are set in terms of rewards and probabilities, with no 
description of the process or initial state. This simplification is usually harmless, as our intuition 
allows us to adjust the model, including factors like time spent or effort in the description of 
rewards. In the Everettian case we can not count on intuition, and, as discussed in section 
13.1.31 we can not include, in the definition of rewards, all relevant factors, as we do not know 
what these are. 


Definition (Accessible States). Let E £ £. A (p £ E is accessible (from ip, via U) if there are 
M £ A4, Ip £ M and U £ Um such that Ou C E and Uip = p. 

So a state is accessible if it can be reached from a non-branched state, via some act. 
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Non-branched states are trivially accessible, and, assuming the decision problem starts at a 
macrostate, all states of interest should be accessible. We extend which was only dehned 
at non-branched states, to accessible onef^. 


Definition (Extended Order). Let tp £ E {E £ £) he accessible from 'ip £ M {M £ A4) via 
U £ Um- For V, V' £ IAe, we write V V' when VE V'U. 


By [State Supervenience[ y^p does not depend on ip, M or U, and by [Ordering! it is a total 
order. It coincides with the original order if 15 £ At. 

This order corresponds to part of Wallace’s Diachronic Consistency axiom, which “rules 
out the possibility of a conflict of interest between an agent and his future selves”, since “an 
agent’s actions take time to carry out.. .if his preferences do not remain consistent over this 
timescale, deliberative action is not possible at all” |Wall2l p.l68]. If the agent prefers, at hrst, 
VU over V'U, but after doing U changes his mind and does V' instead of V, he ends up at 
his initially least preferred result. This part of the axiom needs no justification if one accepts 
[State Superveniencel 


Definition (Null Event). Let E,F £ S, and p £ E he accessible. We say E is null at p if 
U V for any U,V £ Uf such that U\f±- = VIf^ ■ 


So E is null at p if the agent, at this state, is indifferent to what happens on E, as long 
as it does not affect its complement. As we show, this last condition is quite strong in the 
quantum case. Wallace clairn^^ |Wall2l p.l78] nullity has the following properties, for any 
El, E 2 ,E £ £ and any accessible state p £ F: 

Null Subevent. If Ei is null at p, and E 2 C Ei, then E 2 is null at p. 

Null Disjunction. If Ei and E 2 are null at p then Ei V E 2 is null at p. 

These have classical counterparts (Appendix A.3), and the first one is immediate. The 
second seems intuitive: if the agent does not care about Ei or E 2 , he should not care about 
El V E 2 . The classical proof uses the fact that classical acts (functions mapping states to 
payoffs) at an event and at its complement are independent. It does not adapt to the quantum 
case, where acts (unitary maps) at E usually affect E^. In fact, [Null Disjunctionj seems wrong. 

Let M £ M and V, W be acts with V|m-l = Then W — VU, with = Im-l- By 

unitarity, U(M) C M, so U only moves states inside M. As the agent is indifferent to states of 
M, we should have U 1 and W V , for all p. So macrostates are always null, which seems 

fair: M is too coarse if the agent cares about states moving in it. [Null Disjunction] extends 
this to disjunctions of macrostates, which is not plausible, as the agent may care about states 
switching macrostates. This also contradicts the [Nullity Lemmaj which uses the properties. 

The rest of Walace’s Diachronic Consistency is a different idea, which got mixed with it as 
he only defined y^j, at macrostates. We present it as the following axiom. 

Branch Independence. Let E £ £ have a partition E = ViMi with Mi £ M. Given an 
accessible p £ E, and V, V' £ IAe, let Vi = V\Mi, Vl = V'\ Mi and Pi = YiMiP- Then: 

• if Vi Vi for all i with Mi not null at p then V V'; 

• if, in addition, Vi y^^ Vf for some i, with Mi not null at p, then V y^ V'. 

This is reminiscent of the classical [Independence[ condition. It relates the order at a branched 
state to preferences at its branches, when all the agent’s versions agree. If, for all of them, the 

^®In [Wall2l p.l66], Wallace says would be an order on acts at events, but he only defines it at macrostates. 

^^He mentions sets and unions, but it is clear he wants the properties to hold for events and disjunctions. 
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act corresponding to V, via restriction, is preferred, or equivalent, to that given by V', then V 
is preferred, or equivalent, to V'. And it becomes a strict preference if it is so in at least one 
branch that the agent cares about, i.e. non-null. 

Perhaps the condition of Mi not being null at 93 should be replaced by it being nonorthogonal 
to If this means has a branch in Mi, it seems reasonable the agent should care about it. 

This would allow the axiom to be used in the lDominance Lemmal despite the problem with the 
[Nullity Lemma] 

If i? £ Ad (so the Mi’s are a refinement of it), and Vi Vl for some i, it seems the same 
should hold for all i, as all states of E are similar. The axiom implies V >~,p V' , so refinements 
and coarsenings preserve preferences. With the [Orthogonality Condition) different partitions 
of E £ £ have a common refinement, so if the same preference between V and V' is held in 
all branches of a partition, it is also held in any other. So the axiom never leads to conflicting 
results, depending on the partition used. Without that condition, another partition might have 
branches physically very different from the </ 3 i’s, as seen in section [^.1.21 Preferences might be 
reversed in them, leading to V -<,p V' . We note that this problem also happens in Wallace’s 
original formulation. 

Macrostate Indifference. Let M, M', N\, N 2 £ M, ip £ ‘’P' G > U,V £ Um, U',V' £ 

Um' , and ri, r 2 £ TZ. IfOu, Ow C Ni Ari and Oy, Oy' C N 2 Ar 2 , then U V U' V'. 

Wallace’s informal description is that “an agent doesn’t care what the microstate is provided 
it’s within a particular macrostate” [ Wall21 p.170]. Decisions can not depend on microscopic 
details, for the agent has no such fine control, and if two states are different enough to affect 
his preferences, they should be in distinct macrostates. 

If M = M', U = U' and V = V' , we get U V U V , so preferences do not depend 
on the initial microstates. If M = M' and ip = ip', we find that the final ones do not matter 
either. This agrees with the informal description, but the formal statement is much stronger. 

It allows M 7 ^ M', so preferences can not even depend on the initial macrostates. Nor on acts, 
as with Ip = Ip', Ni = N 2 , ri = r 2 , U = V' and V = U' we get that all acts sending ip into 
Ni A ri are equally preferred. Basically, preference between acts, with images in sets of the 
form N Ar, depends only on these. This needs further justification, perhaps along the lines of 
[State Supervenience] 

We note that Wallace’s use of A r is strange. As discussed in section 13.1.31 can some 
microstates of be in r and others not? If so, can the agent distinguish them? 

Wallace says [Wall21 p.180] the axiom is used, with[Reward Availabilitv[and lReward Preservation! 
to define the lOrder on Rewardjf^ . He does not elaborate, but extending the equivalence of pref¬ 
erences from A Ar to the whole r may also need [State Supervenienc^ And, as not all states of 
r might be connected via some available U, it may not be enough. We got that order by other 
means, and did not use the axiom at all. 

Solution Continuity. Let M £ M, ip £ M and U,V £ Um- If U ’^- 4 , V then U' >- 4 , V' for 
any U',V' £ Um sufficiently elose (in the operator norm) to U and V. 

Small perturbations of acts can not alter preferences, if rational decisions are to be possible, 
as agents can not distinguish arbitrarily close acts, nor execute them with microscopic precision. 

But branch discontinuities allow even small changes to create completely new branches, which 
might affect the decision. So lack of fine control could mean rationality is really not possible, 
in accordance with section m 

This axiom is only used in the [Utility Lemma| with [Standard ActsI Restricted to these it 
seems reasonable, except, perhaps, at standard acts of weights 0 or 1 , due to branch disconti- 

^®He also says it could be used to obtain his version of Branching Indifference, if M = £■ 
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nuities. But the agent can never execute standard acts with absolute precision, and even tiny 
deviations could have important consequences. 


3.3 Proof 

We present Wallace’s proof assuming all problems with axioms and concepts can be fixed. It 
was reorganized to highlight parallels with the classical decision under risk fAppendix IA.2II . 
We used a more explicit notation, filled in details and clarified some points, but the idea is 
essentially the same as Wallace’s. Use of axioms is clearly indicated, except for IRestrictionI 
[Indolent |Composition[ and [Ordering which are quite simple and ubiquitous. Here we use 
normalized states to simplify the notation. The reader is invited to check this is nowhere related 
to a probabilistic interpretation. 

The first lemma shows preference on bets (represented by acts Ui, C/ 2 , Ui and V 2 ) depends 
only on the norms of projections of the final states (so, on their Born weights) on reward 
subspaces. It corresponds, classically, to the assumption, expressed in ([9]), that in decisions 
about lotteries all that matters are the probabilities of their rewards. 

Equivalence Lemma. Let Mi, M2 € M, tpi G Mi, ip2 G M2, Ui,Vi G Umi, U2,V2 G Um^, 
and suppose, for all r £ TZ, IHrC/i'i/’il = |nrC/ 2 '!/’ 2 | and |nrVi'!/’i| = \LlrV2'ip2\- Then Ui 
Vi U2 ;^V'2 ^ 2 - 


Proof. We organize the proof in 4 steps. 

Step 1 ) We use branching acts Wi.r,i and W2,r,j to split branches and ip2,r,j of Uiipi 
and U21P2, in reward r, into new ones and (fi2,r,j,i of equal Born weights. 

Let ifi = Ui'ipi, (^2 = U 2 'ip 2 , El = Oui, E 2 = Ou 2 , R = {r £ TZ : Tlr^pi^ 0 }, and, for each 
r £ R, let ipi^r = HrPi, 'P 2 ,r = YiTP 2 , Ei^r = Hr El and E 2 ,r ~ Hr-i/j^T Given a partition 
{Mi^r,i} of Ei^r, let ipi,r,i = Hmi ^ and Ir = {i ■ pi,r,i 0 }. And, for a partition {M2,r,j} 
of E 2 ,r, let ^ 2 ,r,j — n]Vf 2 , 7 .,j ^ 2 ,r and Jr — • ^ 2 ,r,j 7 ^ b}. 

ForiG/randj G Jr,letpr,i = \pi,r,i\^/\'Pi,r\^ wiAqr.j = \(p2,r,j\^/\'P2,r\^ ■ [Branching Availability| 
gives Wi,r,i £ such that G r has a partition {Ni^r,i,j}j£Jr with \(t)i,r,i,j\ = 

\Ti,r,i\'^-qr.j, where (fi,r,i,j = Wi,r.,iV 2 i,r,i. We also get W2,r,j £ UM^.r.j C r 

having a partition {N 2 ,r,i,i}ieir such that \(j)2,r,i,i\'^ = Pr,i-\‘P2,r,3\^, where <j)2,r,i,i = HjVj ■ W2,r,iV22.r.j. 
By hypothesis \‘Pi,r \ = |v22,r'|, so we have 




lyi, r,-i I • \<P2,r,j \ 

\ip2,r\ 


— \4>2,r,j,i\. 


(8) 


For i ^ Ir OT j ^ Jr, let and W 2 ,r,j = ^M2,r,j- 

step 2) The Wi,r.i’s (resp. W2,r,j’s) are combined into Wi (resp. W/i). Irreversibility 
ensures the cj)i,r,i,j’s (resp. 4 ) 2 ,r,j,is) remain as branches of WiUiifi (resp. W 2 U 2 ip 2 ). 


Compatibility! gives Wi G Uei, W 2 £ Ue2 with = Wi,r,i and W 2 \M 2 .r.j = ^ 2 ,r,j. 

By Irreversibility! the Owi r i are mutually orthogonal, as are the Ow 2 r j 80 HjVi r i = 
4>i,r,i,j, and IlN2,r,3,i4>2 = 4’2,r,j,i, where 4>i = Wiifi and 02 = W 2 P 2 . Also, 


Owi = \/ Owi^r.i 

r,i 


V 

r^R 


V 



V 



with a similar decomposition for Ow 2 ■ 

^^The!Orthogonality Condition] ensures HrOij = Ojj Ar, so that Ei^r,E2,r £ £■ 
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step 3) We use erasures to eliminate the differences between 4>i,r,i,j and (f> 2 ,r,j,i, and 
combine them into acts Xi and X 2 snch that XiWiUiijji = X 2 W 2 U 2 'ip 2 - 


By dS}, [Erasure] gives Xi^r,i,j € a^nd X 2 ,r,j,i € HN 2 ,r,j,i with C r 

and Xi^r,i,j4‘i,r,i,j = X2,r,j,i4>2,r,j,i- [Compatibility j gives Xi G l^Owi and X 2 G L^Ow 2 such that 
Xi\Ni,^^ij = Xi^r, i,j and X2\N2,r,i,i = for i G h and j G Jr, for 

i ^ Ir, and X 2 \M 2 ,r,j = ^M 2 ,r,j for 3 ^ Jr- By construction, Xicjn = X 2 <i> 2 - 

Step 4) The agent is indifferent to the branchings W and erasures X. They take the U'tp's 
to the same hnal state, and a similar procedure can be done to the Vip's, so the preferences 
between the U’s and the l^’s must agree. 


By [Branching Indifferenc^ and lErasure Indifferen^ we have TTi,r,i r i r i’ ~¥>2 

tM2,r,j, aud X2.r.j.i ~</.2,r,3,i '^N2^r,j,i- Therefore [Branch Independence 

gives Wi Isi, W2 ~Lp2 1b 2. ^1 ~</>i lowi ^2 ~^2 lowa- 

Let U[ = XiWi and U 2 = X 2 W 2 . Then UiUiipi = U 2 U 2 ^p 2 , and, by the [Extended Orderl 
UiUi Ui and U 2 U 2 ^^2 ^ 2 - Likewise, we can get V{ G Uov^ and V 2 G Uov 2 with VlVixpi = 
V-^V 2 'ip 2 , VlVi Vi and E 2 V 2 V 2 . Finally, [State Supervenience] gives U[Ui E/Vi 4^ 
C/2C/2 y^2 V2V2. □ 


Corollary 1. Let Ei,E2 & £, ifii £ Ei and ip2 G E2 be accessible states, Ui,Vi G Uei, 
and U 2 ,V 2 G Ue 2 - If, for all r £ TZ, \\lrUiip\_\ = \\lrU 2 < 4 > 2 \ and IllrCivsil = |nrV 2 V? 2 |, then 
Ul >~,pi Vl 44, U2 '?~IP2 ^2- 


Corollary 2. Let E £ £, ifi £ E be accessible, and U,V £ Ue- If IllrC/i^l = IIIrEv^l for all 
r £71 then U V. 

This allows us to get a preference on rewards from that on acts, ft might iustifv iRewards as Preferences! 
if not for the use of axioms depending on that characterization. Classically, this order comes 
from identifying rewards with simple lotteries. 


Definition (Order on Rewards). We write r y s, for r, s £ TZ, if U y,i, V for all accessible 
Ip £ E {E £ £) and U,V £ Ue such that Uip £ r and Vip £ s. 


By [Corollary 1 it is enough that U V for some ip,E,U,V satisfying those conditions. 


and by Reward Availability] and [Orderiii^ this is a total order on 7Z. 

As usual classically, Wallace assumes the next result as valid, to have a nontrivial problem. 
But in the quantum case the order on rewards is derived from that on acts, so it is best to get 
its nondegeneracy from the more basic one on acts. 


Reward Nondegeneracy Lemma. There are r,s £7Z such that r y s. 


Proof. Fix s £'1Z, and suppose r ~ s for all r £'1Z. [Act Nondegeneracy] gives M £ Ai, ip £ M 
and U,V £ Um such that U y,/, V. Partitioning each r £7Z into macrostates, let {Mi} be the 
set of all those with ipi — IlMiUip rfz Q. Reward Availability] gives, for each i, some Wi £ Umi 
with Owi C s. The [Order on Rewards! implies Wi 1m . [Compatibility! gives W £ Uou 
such that W|Mi = Wi, and [Branch Independence implies W r^uti, lotj. Then WUip G s, and, 
by the [Extended Order! WU U. Likewise, we obtain W' with W'Vip £ s and W'V V. 
By [Corollary 2[ WU W'V, so that U V, contradicting their choice. □ 


Definition (Extremal Rewards). We fix ro,ri G TZ such that ro A ri and ro r ri for all 
r £TZ. 
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Existence of vq and ri is assured by the previous lemma and the finiteness of TZ. As Wallace 
assumes no such condition, his original proof requires some technical workarounds, which are 
usual in the classical theory. 

The next lemma characterizes a I Null Event I showing the only way an agent will not care 
about an event is if it contains no branches. As discussed before, it may be incorrect. This is 
the only part of the proof whose classical parallel is in the case of decision under uncertainty 
lAppendix lA.3ll . It appears because, instead of lotteries, Wallace uses a framework of acts and 
events, typical of that case. 

Nullity Lemma. E £ £ is null at an accessible state ip if, and only if, HeiP = 0. 

Proof. If HeiP = 0 then U\e± = V\e±- gives Uip = Vip. By [State Supervenience| U V, so 

E is null. 

Now suppose E is null at ip, but \IIeiP\‘^ = a ^ 0. With [Reward Availabilit^ and [^mpatibility[ 
we can construct U,V € Uu such that Ou C ro, V{E) C ri, and E|gj_ = U\e^. As E is null, 

V -^.0 U. Given any other E' and ip' with = a, we can also obtain U',V' € Un 

such that Oui C ro, V'{E') C ri, and V'\eii- = U'\eii-- By | Corollary 1| V' U'. But 
[Branch Independence! and lOrder on RewardsI would imply V' U', unless E' is also null at 

Ip'. This shows that, whenever IIIbV'I = E is null at ip if, and only if, E' is null at ip'. 

Let n € N, with 1/n < a. With [Branching Availability! ^6 obtain <p and U such that 
Ou = M 1 VM 2 VM 3 , \IlM-iU(p\^ = l/n,\llM 2 U(p\ = a — 1/n, and lIlMjt/^l^ = 1 — a. Since 
\1\m-i'<jM2E4>\^ = a, M 1 VM 2 is null at U(p. Bv INull SubevenB so is Mi. Hence any M is null 
at any ip in which IHM'i/’P = 1/n. 

With!Branching Availability! another (p and U with Ou = MiV ... VM„ and \IlMiU(p\^ = 

1/n for all i. As the Mi’s are null at U(p, by [Null Disjunction! so is Ou. [Reward Availability! 
and [Compatibili^ give V,W £ lAou with Ov C ri and Ow C ro. By the lOrder on RewardsI 

V yuct>W, contradicting the nullity of Ou. □ 

We now define the quantum equivalent of the standard lotteries (O. 

Definition (Standard Acts). Let M £ M, ip £ M and 0 < a < 1. A standard act of weight a 
at Ip is an Ua € Um such that Ou^ C roVri and \IlriUaip\^ = ot. 

These can always be obtained by [Branching Availability! [Reward Availability! ^ thi® 
could be done in a way that Ua depends continuously on a (which seems feasible), we might 
no longer need the [Problem Continuity! axiom. 

The next result corresponds to the classical [Monotonicity! 

Dominance Lemma. Let Ua and Up be standard acts of weights a and pi at ip. Then a > 

/5 ^ Ua yip Up. 

Proof. 11 a > P then, using [Branching Availability! we can obtain ip and U such that Ou = 
M 1 VM 2 VM 3 and, letting <pi = LlMiP for (p = Up, 

\(pi\^ = P, \(p2\^=a-P, \<p3\^ = l-a. 

By [Reward Availability! [Compatibility there are V,W £ Uou such that 

E(Mi)cri, E(M 2 )Cri, ^(Ms) C ro, 

IT(Mi) C ri, IT(M 2 ) C ro, IT(M 3 ) C ro. 


The lOrder on RewardsI implies E|mi ~</)i IE|mi, E|m 2 y<p 2 W\m 2 , and E|m 3 ~</)3 IE|m 3 . 


By the Nullity Lemma! M 2 is not null at <p, so 

Branch Independence! gives V ypW. The 

lExtended Orde^implies VU y.n WU. and bv the 

Equivalence Lemma Ua yp. Up. 
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li a = P then Ua ~i/' by [Corollary 2| The converse follows via|Ordering| 


□ 


The next lemma forms an utility function u{r) by comparing standard acts with simple acts 
Vr (which lead to a single r), as in the classical case. 

Utility Lemma. There is an uniau^^ u : 7^ —>■ [0,1], with u{ro) = 0 and u{ri) = 1, such that 
Uu(r) Vt for any r £ TZ, tp M (M £ M), any standard act Uu(r) of weight u{r) at ip, and 
any act Vr with Vrip € r. Moreover, u{r) > u(s) r s. 


Proof. For any M £ Ai, ip £ M, r £ TZ, and 0 < a < 1, there are, by [Branching Availabilityj 
and [Reward Availability a standard act Ua £ Um of weight a sd, ip, and an act Vr £ Um with 
Vrip £ r. By the Equivalence Lemma[ the definition 

u{r) = sup{a : Ua K}, 


does not depend on the choice of ip, Vr or Ua- By construction, Ua y-ip K- if a > u{r), and by 
the lDominance Lemmal f/.v -<.,p K- if a < u{r). 

Suppose C/u(r) Then u(r) 7 ^ 0, or the lOrder on RewardsI would give ro >- r. By 

[Solution Continuity[ U Vr for all [/ in a neighborhood of Uu{r)- For a < u{r) close enough 
to u{r), we can assume Ua is in this neighborhood (if necessary, using [Problem Continuit^^ to 
get Ua as a perturbation of Uu(r)), contradicting Ua -<iii K-. By a similar argument it is not 
possible that Uu(r) Fr- So Uu(r) Fr, and by the [Equivalence Lemma] this holds for any 
other acts with the conditions of the lemma. 

The IDominance Lemmal and the lOrder on Reward imply that u{r) > u{s) 

Uu(s) ^ Vr Va r y s. To prove unicity, let u be another utility function. Then 
Uu(r) Vr Uu(r), and by the lDominance Lemmal ufri = u(r). □ 

The dehnition of expected utility m, from classical decision theory, is adapted with Born 
weights in place of probabilities. 


Definition (Expected Utility). The expected utility of an act U at a state ip is 

EUv,(C/) = ^ IHrUipf -uir). 

r£Tl 

For standard acts, EU(17c,) = a. The next lemma shows any act is equivalent to a standard 
act of equal expected utility. It corresponds to the use of [Substitutabili^ in the proof of the 
Von Neumann-Morgenstern Theorem (IVNMIl . 

Standard Act Lemma. Let M £ M, ip £ M, U £ Um- Then U Ua, where Ua is a 
standard act of weight a = EU.^!,(17) at ip. 


Proof. Let {Mr,i} be a partition of r £ 7?. in macrostates, ip = Uip, and pr.i = LiMriT- 
With [Branching Availabihty[ [Reward Availability[ and [Compatibility[ we get W £ lAn such 
that W\Mr i is a standard act of weight u{r) at if Tr,i 7 ^ 0 , and VF|m,. 1 m,. ^ if Tr.i = 0 . 

Then \L\.r^Wipr,i\^ = u{r) ■ \Tr,i\^, so thalPI 




2 


^ ^ n,.^ Wp>r,i 

r,i 


^ \nr,Wipr,i\' = X] u{r) ■ 


EU^(C/), 


and WU is a standard act of weight EU,/,(17) at ip. As the [Utility Lemrnal implies 1 U|m,. i^Vr i 
1m, [Branch Independence[ and the Extended Orderi give U WU. □ 

^’^Other values of ri(ro) or u{ri) allow for positive afBne transformations, as in the classical theory. 

^^The second equality requires orthogonality, which comes from the use of [Compatibility] to obtain W. 
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Finally, as preference on standard acts increases with their expected utilities, the same holds 
for all other acts, and we obtain the quantum equivalent of IVNMl 

Born Rule Theorem. U y-i/, V 4^ EU^([/) > EU^(F). 

Proof. By the [Standard Act Lemmal U and V are equivalent to standard acts of weights 
EU^(C/) and EU^(E) at ip, so the [Dominance Lemmal gives the result. □ 

Except for the use of Born weights instead of probabilities in EU, this corresponds to the 
|Principle of Maximization of Expected Utility| justifying the |Born Strategy] So, for rational 
decisions in EQM, Born weights work as if they were probabilities. 


4 Comparison with the classical cases 

As the technical details of the proof can obfuscate its idea, we now discuss simple examples in 
comparison with classical decision theory. We take only three rewards ro ^ r ^ ri, and set 
utilities u(ro) = 0 and u(ri) = 1 as reference values. 

1. In the classical decision under risk fAppendix [A.2[| . take standard bets Bp which give 
ri with probability p, otherwise give tq. If the agent’s preference increases continuously 
with p, there will be some Pr for which he will consider Bp,, as good as receiving r with 
certainty. This is taken as a measure of his interest in r, and called a (subjective) utility 
w(r). 

Given a bet A, with probabilities ag, Ur and ai for ro, r and n, let A' be identical to 
it, except that the event that would give r now leads to a secondary bet Bp,.. As the 
agent is indifferent between r and Bp,., the same should hold for A and A'. Since classical 
probability rules imply A' is equivalent to Ba,.p,.+ax , the expected utility arPr + Qi is 
taken as a measure of the agent’s interest in A. 

The example needs the agent to be reasonable enough, with a minimal knowledge of how 
probabilities work, for his preferences to satisfy the axioms of IVNMl 

2. In decision under uncertainty (Appendix [A.3I) . the probabilities are unknown. But if the 
agent’s opinions are reasonable, we can first get a subjective likelihood order on events, 
by checking in which ones he would prefer to receive ri. 

Then suppose we have a collection of events he considers as unlikely as need be, so his 
preferences are little affected by what happens in any of them. We can use, for example, 
roulettes with arbitrarily large numbers of results. Even if the agent thinks they are 
unfair. Savage proved the events can be combined into new ones he will accept as equally 
(un)likely. So we might as well assume he believes in their fairness. For a roulette with 
n results, define standard bets Bm giving ri for the first m results, and rg for the rest. 
As m increases, the rewards of some results improve, so the agent’s preference should 
increase gradually. 

Given any event E, let a bet A give n in case of E and ro otherwise. If the agent 
prefers A to Bm until m reaches a certain value, his subjective probability for E is in 
the interval [—A precise value is obtained via a limiting process. Having his 
subjective probabilities for all events, we proceed as in the previous case. 

All steps work if the agent is reasonable enough, and has enough knowledge of how 
probabilities work, for his preferences to satisfy Savage’s axioms. 
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3. In the quantum case, let a standard quantum bet be one in which the agent measures 
an electron spin, with Born weight w for up, and receives ri in branches of spin up, ro in 
those of spin down. For w' > w, let B' be a bet where, after Bw, his versions which got 
ro measure a spin with weight for up, and in the new up branches the reward is 

improved to ri. The weight of ri is now w', if there is no interference. If the agent does 
not think the new measurement alters the relative relevance of the original ri branches, 
does not mind doing the extra experiment, and wants as much ri as possible, he should 
prefer B' over 

The final state of B' is not the same as if B„/ had been done, due to differences in their 
histories (and maybe in the amount of branches, if it has any meaning). Wallace argues 
they can be erased without affecting the rewards (via unitary acts, so the weights had 
to be the same in both states) or the agent minding it. Then he should be indifferent 
between B' and B^/, and prefer B„/ over B^,. 

Hence his preference for the BZs increases with w. If it does so continuously, for some 
w he will consider B™ as good as receiving r with certainty. This w is taken as the 
(subjective) utility u(r). 

Let a bet A give ro, r and ri according to the result of a quantum measurement of weights 
Wo, Wr and wi. Let A! be similar to A, but in branches where the agent would get r 
he instead performs a secondary bet B^f^^)- Under the same assumptions as before, he 
should be indifferent between A and A'. The weight of ri in A' is Wru{r) +Uii, if there is 
no interference. Erasures allow us to consider A' equivalent to B^^u(r)+'wi , so this weight 
measures the agent’s interest in A. In a parallel with the classical case, Wallace calls it 
an expected utility, and concludes that, for decision purposes, weights play the same role 
as probabilities. 

His axioms try to ensure all assumptions made are valid. But some of them assume a 
knowledge of how branches work that is beyond what we have. 

Comparing the examples, we see the quantum case adapts just the first one. In decision 
under risk, assumptions about the agent’s preferences allow us to infer his subjective utilities 
via probabilities in standard bets, and the result follows from the way classical probabilities 
combine in successive bets. In the quantum case, assumptions about the agent’s preferences 
allow us to infer his subjective utilities via Born weights in standard quantum bets, and the 
result follows from the way Born weights combine in successive branchings, if there is no 
interference. 

Decision under uncertainty has extra steps, to get subjective probabilities, not present in 
Wallace’s case. Comparison of his proof with Appendix IX3l shows he takes from Savage little 
more than the language of events, acts and payoffs. But while Savage sets payoffs apart from 
states of the world, Wallace identifies rewards with sets of states in which they were given. 
Acts, which for Savage are any maps from states to payoffs, become unitary maps on states. 
This affects another concept taken from Savage, nullity, which, as seen, did not adapt well to 
the quantum case. 

It should have been clear, from the start, that the idea that Born weights work as subjective 
probabilities is incorrect. For if all rational agents use the same weights as probabilities, these 
can not be considered subjective (unless this term is taken in its broadest sense, to include all 
we believe to know, which is not the strict sense in which Savage’s subjective probabilities are 
defined). 
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5 Conclusion 


The first reaction to the idea of using decision theory to obtain the Born rule tends to be 
disbelief. But, upon learning that it provides a formal framework for subjective probabilities, 
and studying Deutsch’s proposal, many find it appealing. We found Wallace’s proof to be, for 
the most part, correct, although some details needed fixing. But concepts and axioms upon 
which it is based present problems, and the more we delve into the formal details, the less likely 
it seems that they can be solved. 

By their nature, decision theoretic proofs must assume worlds, reasonably similar to ours, 
emerge from EQM. The details of such hypothesis should be clearly stated. Are all branches like 
this? Are they orthogonal to each other? Is some interference possible? Is the decomposition 
unique? How similar to our reality must these worlds be? If they sustain intelligent life, has 
it evolved to use a reasoning similar to ours, or one better adapted to a branching Universe? 
Validity of the result depends on the answers being properly justified. 

Wallace’s assumptions regarding these questions are usually left implicit in his concepts 
and axioms. He believes branches, resulting from his solution of the preferred basis problem, 
have all the necessary properties. But, as shown in |Manl7| . that solution is not valid, since 
decoherence depends on the probabilistic interpretation of Born weights to justify its approx¬ 
imations. Analysis of what results from combining EQM and the non-probabilistic aspects of 
the decoherent histories formalism shows branches might not be so well behaved. Some of the 
problems might be alleviated with the proposed causal histories formalism, if it can be properly 
developed. 

The main problems found in Wallace’s concepts, axioms and proof are: 


1 . 

2 . 


[Null Disjunctioiil fails in the quantum case, compromising the [Nullity Lemma[ The rest 
of the proof still works, if [Branch Independencej is changed as proposed. 

Reward subspaces have incompatible characterizations. Reward Availability[ and part of 
[Branching Availability and lErasurel need Rewards as Payoffs But Branching Indifference[ 
and [Erasure Indifference depend on [Rewards as Preferences! 


3. Macrostates should emerge from the solution of the preferred basis problem, but Wallace 
is not clear about all properties expected of them. He gives vague ideas, but his examples, 
and the formalization of some axioms, seem to contradict them. It is also not clear if they 
refer only to the system of interest, or if they also specify the macroscopic state of the 
environment and history records. 


4. Wallace assumes mutually orthogonal branches. These are an idealization in the usual 
decoherent histories formalism, but become highly unstable in its non-probabilistic ver¬ 
sion. Realistic branch decompositions are almost orthogonal, but even tiny deviations 
have important consequences, such as: 


• branch decompositions may be non-unique, and not even similar; 

• [Branch Independence[ might lead to contradictory results; 

• there might be macrostates not contained in any reward subspace; 

• not all branches oiUtp might be in Ou, compromising jlrreversibility [ [Reward Availability [ 
lErasurel and IMacrostate Indifference! 


5. 

6 . 
7. 


Branch discontinuities compromise [Solution Continuityj e [Problem Continuity[ 
[Irreversibility[ and lErasur^ rely on conflicting claims about preservation of records. 

The[Bra nching- Consistency Theorem[ does not extend to history algebras. It only justifies 
[Irreversibility if history records are separated in distinct macrostates. 
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8. Typical branches might not be like onr world, and complex structnres needed for decision 
theory might not exist, or not behave as usual. This compromises [Reward Availability| 
[Branching Availabilityl and lErasurel which depend on a narrative in which agents exist, 
and their actions have the expected consequences. 


9. Rationality might be impossible in an Everettian universe, compromising part of the 
justification for [Branching Indifferent and [Solution Continuity[ 

10. Impossibility of branch counting does not justify [Branching Indifference[ as there might 
exist other measures not preserved by branchings. 


11 . 

12 . 


Use of [Reward Availability[ [Branching Availability or lErasur^ at branched states requires 
further justification, which we attempt in our Compatibility[ axiom. 


The formal statements of IMacrostate Indifferencel and [Reward Preservation! (Wallace’s 
Branching Indiference) differ much from what he describes and justihes. 


13. IMacrostate Indifferencel is not needed, and part of Walace’s Diachronic Consistency fol¬ 
lows from [State Supervenienc^ 

14. There are also miscelaneous difficulties withlErasurel IMacrostate Indifferenceland[State Supervenience[ 


Even if all problems can be solved and Wallace’s result is correct, it does not mean Born 
weights are an Everettian agent’s subjective probabilities. His proof is not based on such 
concept, at least not in Savage’s decision theoretic sense. Born weights are objective parameters 
the agent uses to decide, formally in the same way a classical agent uses probabilities. But as 
he knows all branches exist, the meaning he attributes to such weights can not be probabilistic 
(as seen in section 12.31 appeals to self-locating uncertainty do not seem valid). Interpreting 
his reasons, in a possibly quite distinct Universe, in terms of our familiar concepts (call it 
subjective probability, credence or degree of belief), would be a huge extrapolation. 


A Classical decision theory 

Decision Theory [PI09I IKarl4[ is an interdisciplinary area of study, concerned, among other 
things, with developing principles to optimize decision making. Decision problems are usually 
framed in terms of an agent having to establish a preference order >- on lotteries. A lottery 

A = {{pi,ri)}ieo (9) 

consists of a set O of mutually exclusive outcomes, each having probability pi and giving a 
reward ri. 

Each reward r can be identihed with a simple lottery, which gives it with certainty as its 
unique reward. An order on lotteries induces another on rewards. If all rewards are equally 
preferred, the problem is trivial. Otherwise, we assum43 the existence of least and most 
preferred rewards, labeled as ro and ri. 

Given lotteries A and B, and t G [0,1], a compound lottery tA-\-{l — t)B is one in which 
the probability of each reward is tpA -I- (1 — t)pB, where pA and ps are its probabilities in A 
and B. Equivalently, it is a lottery in which the agent first chooses randomly to bet on A or 
B, with probabilities t and 1—t respectively. For each t £ [0,1] we dehne a standard lottery of 
weight t as 

Lt = {I— t)ro+ tri. (10) 

^^This is just for simplicity, the theory can be adapted to work without such extremal rewards. 
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A.l Decision under certainty 

In Decision under Certainty, each lottery has a unique outcome. The agent’s preference is 
rational (in a strictly decision theoretic sense) if it is a total order, i.e. it satisfies, for any 
lotteries A, B, and C: 

Completeness. Either Ay B, or B y A, or A ^ B. 

Transitivity. If A B and B ip C then A)p C. 

A.2 Decision under risk 

In Decision under Risk, lotteries have multiple outcomes, with known probabilities. A crude 
strategy uses the Expected Value of lotteries, i.e. the average value of the rewards, weighted 
by probabilities. This seems reasonable, by the Law of Large Numbers, but can lead to unrea¬ 
sonable results, like the St. Petersburg Paradox, when the number of runs is finite. A more 
flexible strategy, proposed in 1738 by D. Bernoulli | Ber54 |. replaces monetary values by an 
utility function u(r) on rewards, leading to the following strategy. 

Principle of Maximization of Expected Utility. The preference order is induced by u{r) 
according to Ay B ^ EU(A) > EU(i3), where the expected utility EU of A = {{pi,ri)}i^o is 

EU(A) = (11) 

ieo 

Given a preference order and an utility function u, y is induced by u if, and only if, the 
following Axioms of Utility are satished, for any rewards r, r' and r"-. 

Axiom Ul. If r y r' then u{r) > u{r'), and if r ^ r' then u{r) = u{r'). 

Axiom U2. If r" ^ tr -\- {1 — t)r' then u{r") = t ■ u{r) -|- (1 — t) • u(r'). 

Any which is induced by some u satisfies |Completeness| [Transitivity[ and also: 

Independence. If Ay B then tA + {1 — t)C y tB + (1 — t)C, for any t € (0,1]. 

Archimedean Property. If Ay B y C, there are s,t G (0,1) such that tA + (1 — t)C y B y 
sA-b (1 - s)^. 

[Independence! means a preference between compound lotteries is based only on components 
that are different. By the [Archimedean Property! no lottery is so incommensurately better 
(resp. worse) than other, that it is impossible to reverse preferences by compounding it ap¬ 
propriately with a worse (resp. better) one. It also means that sufficiently small changes in a 
lottery do not alter significantly the order. 

These four properties dehne rationality for Decision under Risk, and imply the following 
ones: 

Substitutability. If A ^ B then tA + {1 — t)C ^ tB -|- (1 — t)C, for any t € [0,1]; 

Monotonicity. If Ay B then tA-\- (1 — t)B is more preferable for higher values of t; 

Continuity. If A y B y C there is a unique t £ (0,1) such that B tA + {1 — t)C. 

The importance of the [Principle of Maximization of Expected Utility! became clear in 1944, 
when Von Neumann and Morgenstern IVNM44| proved that any rational preference order can 
be obtained from it. 
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Von Neumann-Morgenstern Theorem (VNM). If a preference order y satisfies \Completeness\ 
I Transitivity^ \Independence\ and the\Archimedean Property\ it can be induced by an utility func- 

iioS 


Proof. |Completeness|and|Transitivity|mean lotteries are ordered in a single chain without loops, 


and, by the Archimedean Property no lottery is infinitely better or worse than other. So ^ 


can be described by a correspondence between lotteries and real numbers. By |Monotonicit}^ 
preference on standard lotteries Lt increases with t. By |Continuity[ for any reward r there is 
a unique t £ [0,1] such that r Lt, and we set u{r) = t. As each reward r of a lottery A is 
equivalent to Lu(r-), by lSubstitutabilft^ A is equivalent to Leu{A)- Thus, preference on lotteries 
increases with their Expected Utilities. □ 


A.3 Decision under uncertainty 


In Decision under Uncertainty, there are multiple possible outcomes, but their probabilities 
are unknown. Instead of lotteries, problems are usually framed in terms of a preference order 
on acts which, depending on possible states of the world, will lead to payoffs (consequences). 

Some decision strategies {maximax, maximin, minimax regret, etc.) focus on best or worst 
cases. These are useful in some situations, but can lead to absurd results in others, as they 
disregard important data, like non-extremal payoffs, or the low likelihoods of some states of 
the world. 

Another strategy uses subjective probabilities, estimates by the agent of the likelihoods of 
states of the world. But the decision can be very sensitive to the estimates used, especially if 
there are unlikely states giving huge payoffs. Despite this, L. Savage |Sav72 | proved, in 1954, 
that any preference order satisfying certain axioms can be induced, as before, by some utility 
function and subjective probabilities. 

Let <S be the set of states of the world. Not all details of a state are relevant, so we consider 
events, subsets of states with some common characteristic. For example, the event E = “it rains 
tomorrow” consists of all states in which this happens. Events can be partitioned into smaller 
ones, e.g. E could have subevents “it rains tomorrow and stock prices go up” and “it rains 
tomorrow and the result of a die is 3”. They can be combined using logical operators such as 


AND, OR, NOT. 

An act is a function f ■. S ^ V, where V is the set of payoffs, so that f{s) = x means x is 
the payoff resulting from act / if the state of world happens to be s. Each payoff x is identified 
with a simple act, which results x for all states. Given acts / and g, and an event E, we define 
a new act [/, E-,g] by 


[f.E-,g]{s) 


f{s) if s G E, 
g{s) if s ^ E. 


Savage adopts, for a preference order on acts, the following axioms, which are usually 
taken as mandates of rationality for decision under uncertainty. 


Axiom SI (Nondegeneracy). There are payoffs x, y such that x y y. 


Axiom S2. ^ is complete and transitive. 


Axiom S3 (Sure-Thing Principle). For any event E and acts f,g,h and k, we have 
[f,E-,h]^[g,E-,h]^[f,E-k]ip[g,E-,k]. 

^^Unique up to positive affine transformations u au ^ b, for real constants a > 0 and b. 
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This is similar to |Independence] preference between acts depends only on the events where 
they differ. It allows us to define, for each event E, a conditional preference order by 

E 


f y g and only if [/, E-, h] y [g, E\ h\ for all h, 

meaning “/ is preferable to g given E”, i.e. if the agent assumes E will happen he will prefer 
/ to g- 

An event E is null if f g for all / and g. The idea is that the agent does not believe E 
can happen, so he does not care about the acts on E. Nullity has the following properties, for 
events E and F-. 

Classical Null Subevent. If E is null, and E G E, then E is null. 

Classical Null Disjunction. If E and F are null then E U F is null. 

They result from properties of sets and functions, like the fact that a function can be 
redefined in a subset, without affecting the complement. We prove the last one, for comparison 
with the quantum case. 

Proof. Given any acts /, g and h, define a new act k by 


f5(s) 

if s G S, 

Ha) = < /(s) 

if s^F-E, 

U(«) 

if si EVJF. 


As E is null, [/, E U F-,h] ~ fc, and as F is null, k ~ [g, E U F-, h]. The result follows from 

[H □ 

Axiom S4. For any payoffs x and y, and any non null event E, we have x y y x y y. 

Hence the agent’s preference on payoffs does not depend on which event leads to them. 

Axiom S5. Let x,y,z and w be payoffs, and E,F be events. If x y y and z y w, then 
[x,E;y] y [x,F;y] [z,E-w] y [z,F;w]. 

The intuitive idea is as follows, [a:, E-, y] gives a better payoff in case of E than otherwise. 
\x, F\ y\ gives the same payoffs, but the event giving the good payoff is F instead of E. Bv lS4l 
the agent only cares about the payoff and not how he got it, so the only reason for preferring 
[x,E-,y] to \x,F\y\ is that he thinks E is more likely than F. So the same preference should 
hold for any pair «, w of good/bad payoffs. 

This induces an order on events by 

E y F if and only if \x, E\y] y [x,F\y\ whenever x y y. 

It means the agent thinks E is more likely than F, as he prefers the better reward be given in 
case of E than in case of F. Savage proves this order is a qualitative probability (see |Sav72 | 
for definition and proof) on events. The next axiom allows it to turn into a quantitative one. 

Axiom S6. If f y g then for any payoff x there is a finite partition S = such that 

[x, Ei- f]y g and f y [x, Ee, g] for every i. 

This is similar to the [Archimedean Property] It means the set of states of the world can 
be decomposed in events Ei deemed so unlikely that changing the payoff on one of them is 
not enough to alter preferences. And no payoff is infinitely good or bad, or it would alter 
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preferences for any non null Ei. The partition could be given, for example, by the results of a 
roulette with a huge number of slots. 

Savage extends this to events, proving that if F >- E there is a finite partition S = U"^iEi 
such that F >- E U Ei for any i. Some Ei’s may be deemed more likely than others, but, after 
some technical work, he obtains partitions whose events the agent considers equally likely. 
Then it is straightforward to obtain probabilities. Partitioning S into n equally likely events, 
we attribute to each a subjective probability i. If, for a large n, the minimum number of these 
pieces needed to cover some event E is m, its subjective probability should be close to ■^. The 
exact value is defined via a limiting process. 

Once we have the agent’s subjective probabilities, the problem is reduced to a decision 
under risk. 

A hnal axiom is needed for situations with infinitely many payoffs. It requires that if / is 
preferred, given E, to all payoffs g can give in such event, then / is preferred to g given E. 


Axiom S7. If f >~ g{s) for any s £ E, then f ^ g (and likewise for -<). 
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